, CRF Errors Corrected by th STIC Systems BraMi o / / ^ 

/n/n9^C^7 CRF Pfoc^ng Date: 3 //a /^Q^ : 

Sertef Number: /O/ V /j)^ ' _^ ^ Edited by: \ /> > — 

rhnnned a file from noitA^CIIto ASCII C A| TCr _ V rlllodby: ^ilii^ (STIc;^aff 



Q Changed a file from noivASCn to AS^^^^ £ Hijedby: /. M^ (STIc;^aff) 

Q] Changed the margins in cases where the sequence text was \vrappw" down to the nert line. 
Q Edted a format error In the CuirenJ/pplication Data sectio^^ =^ . 

I 1 Edited the Current Application Data section wHh the actual current number. The number Inputted by the 
applicant was □ the prior application data; or O other ■ 

rn Added the mandatoiy heading and subheadings for -Cun-ent Application Data". 

I j Edited the "Number of Sequences" field. The applicant spelled out a number instead of using an integer. 

I j Changed the spelling of a mandatoiy field (the headings or subheadings), specifically: 

I I Corrected the SEQ ID NO when obviously incorrect. The sequ,ence numbers that were edited we/e: 

I ] Inserted or corrected a nudeic number at the end of a nucleic line. SEQ ID NQs edited: 

r~] Corrected subheading placement. All responses must be on the same line as each subheading, irthe 
applicant placed a response below the subheading, this was moved to its appropriate place. 

I j Inserted colons after headings/subheadings. Headings edited included: . 

, | — ^ Deleted extra, invalid, headings used by an applicant, spedficaJly: 

Deleted: ^i^o(\-ASCl\ "garbage", at the beginning/end of files; □ secretary initials/filename at end of file; 
□ page numbers throughout text; □ other invalid text such as..^^ , , -. ; ^ . 



I I Inserted mandatory heacfings, spedgcally : . ' 

I i Corrected an obvious en-or in the response, spedfically: 



I I Edited ideDtifiQO where upper case is used biit lower case is required, or vice versa. 

rn ^ Con-ected an error in the Number of Sequences field, specifically: 

|~] A "Hard Page Break" code was inserted by the applicant All ocoirrenceij had to be deleted 

[~~| Dereted onding stop codon in amino add sequences and adjusted the "{A)Length:" field accordingly (error 
Yfiiftfna PatanHn hiij)- SAqii«Mif»grftrreded: _ . . 



r~| OHien 



*Examin r: Th abbv corrections must bo communicated to th applicant In tho first Office 
Action. DO NOT send a copy of this form. ^^'^^ 
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C--> 
C--> 



4 
5 
6 
7 
9 
10 
12 
14 
14 
14 
15 
17 
19 
21 
22 
23 
24 
26 
27 
28 
30 
31 
32 
33 



<110> APPLICANT: Olandt, Peter J. 
Meyers, Rachel E. 
Galvin, Katherine a; 
Millennium Pharmaceuticals Inc . 
<120> TITLE OF INVENTION: 33945, A Human Glycosyltransf erase and 

Uses Therefor 
<130> FILE REFERENCE: MPI2001- 018P1RCP1 (M) 
<140> CURRENT APPLICATION NUMBER: US/10/074,527 
<141> CURRENT FILING DATE: 2002-02-12 
<150>. PRIOR APPLICATION NUMBER: 60/269202 
<151> PRIOR FILING DATE: 2001-02-15 
<160> NUMBER OF SEQ ID NOS : 9 . 
<170> SOFTWARE: FastSEQ for Windows Version 4.0 
<210>. SEQ ID NO: 1 
<211> LENGTH: 2850 
<212> TYPE: DNA 
<213> ORGANISM 
<220> FEATURE: 
<221> NAME/KEY 
<222> LOCATION 
<400> SEQUENCE 



homo sapiens 



CDS 

(81) 

1 



(1826) 



ccggctcggt accactataa cggccgccag tgtgctggaa ttcgcccttg cgcagatcgc 60 
tggctgcagt tggcgggcgc atg tgg ggg cgc acg gcg egg egg cgc tgc ecg 113 

Met Trp Gly Arg Thr Ala Arg Arg Arg Cys Pro 



161 



34 














1 








5 










10 


36 


egg 


gaa 


ctg 


egg 


cgc 


ggc 


egg 


gag 


gcg 


ctg 


ttg 


gtg 


etc 


ctg 


gcg 


eta 


37 


Arg 


Glu 


Leu 


Arg 


Arg 


Gly 


Arg 


Glu 


Ala 


Leu 


Leu 


Val 


Leu 


Leu 


Ala 


Leu 


38 








15 










20 










25 






40 


ctg 


gcg 


ttg 


gcc 


ggg 


ctg 


ggc 


teg 


gtg 


ctg 


egg 


gcg 


cag 


cgt 


ggg 


gcc 


41 


Leu 


Ala 


Leu 


Ala 


Gly 


Leu 


Gly 


Ser 


Val 


Leu 


Arg 


Ala 


Gin 


Arg 


Gly 


Ala 


42 






30 










35 










40 








44 


ggg 


gcc 


ggg 


get 


gcc 


gag 


ecg 


gga 


ccc 


ecg 


cgc 


ace 


ecg 


cgc 


ccc 


ggg 


45 


Gly 


Ala 


Gly 


Ala 


Ala 


Glu 


Pro 


Gly 


Pro 


Pro 


Arg 


Thr 


Pro 


Airg 


Pro 


Gly 


46 




45 










50 










55 










48 


egg 


cgc 


gag 


ecg 


gte 


atg 


ecg 


egg 


ecg 


ecg 


gtg 


ecg 


gcg 


aac 


gcg 


ctg 


49 


Arg 


Arg 


Glu 


Pro 


Val 


Met 


Pro 


Arg 


Pro 


Pro 


Val 


Pro 


Ala 


Asn 


Ala 


Leu 


50 


60 










65 










70 










75 


52 


ggc 


gcg 


egg 


ggc 


gag 


gcg 


gtg 


egg 


ctg 


cag 


ctg 


cag 


ggc 


gag 


gag 


ctg 


53 


Gly 


Ala 


Arg 


Gly 


Glu 


Ala 


Val 


Arg 


Leu 


Gin 


Leu 


Gin 


Gly 


Glu 


Glu 


Leu 


54 










80 










85 










90 




56 


egg 


ctg 


cag 


gag 


gag 


age 


gtg 


egg 


ctg 


cac 


cag 


att 


aac 


ate 


tac 


etc 


57 


Arg 


Leu 


Gin 


Glu 


Glu 


Ser 


Val 


Arg 


Leu' 


His 


Gin 


He 


Asn' 


He 


Tyr 


Leu 


58 








95 










100 










105 







209 



257 



305 



353 



401 
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egc 


ate 




ctg 




oy O 


oy t 


ctg 


ccc 


gag 


cgc 


tgg 


aac 


ccg 


» /I Q 


61 Ser 


Asp 


Arg 


He 


Ser 


Leu 


His 


Arg 


Arg 


XJCIX 




fil n 
ox u 


A T^rr 
AX y 


irp 


A G n 
noil 


D'K*r\ 
Jrx \J 




U *i 




110 










lis 


■ 






1 9n 














aaa 


gag 


;a a /T 

aay 


AAA 


Ud L. 


y d L. 


■f- 1- 
Ud t 


gat 


ddt 


■H- rr 

ttg 


ccc 


agg 


aca 


tct 


4y / 


65 Leu 


Cys 


Lys 


Glu 


Lys 


Lys 


TVT* 
Ay i 




T VT 


A Qn 


A en 




Pt*/^ 

cX w 


A T*rr 
r\X y 


Thr* 

X IIX 


Dczx 




Q O 












X J U 










X J 3 














a. 


ata 


gca 


L. L. L- 


tat 


aat* 

CICL L. 


rra a 
y dd 


y\-'V-» 


"f- rt rr 

tgg 


+■ o 
utd 


d t L 


etc 


ttt 


egg 


aca 


^ A 1^ 


69 Val 


He 


He 


Ala 


Phe 


Tvr 


Asn 


Glu 


Ala 


XX 


Ser 


Thr 




XICLl 


A TTT 


Thr 

X XXX 




7 0 1 A n 

/ \J X *x V/ 




















1 

± J u 














/ ^ ^ 1. 1. 


■f* a <^ 


agt gtc 




gag 


a 1^ a 
dWd 




tv^y 


y dt 


dtC 


ctg 


eta 


gaa 


gaa 


gtg 


e; Q "a 


73 Val 


Tvr 

J. Jf X 


Ser 


val 


Leu 


Glu 


Thr 




C J- \J 


A 


Tlo 
X xc 


XIC7U. 


i-lC Li. 


\7X U. 


ox Li 


Va 1 

V dX 




74 








160 










165 










170 
X / u 






76 ate 


ctt 


gta 


gat 


gae 


tac 


a cft 
ay L. 


na +■ 
y d L. 


aYTa 
dy d 


rra rr 
ydg 


r^a r" 


/rf- rr 

Ctg 


a a rr 
ddy 


cf a rr 

gag 


r*rr r* 

ty t 


u uy 


O f* J. 


77 He 


Leu 


Val 


Asp 


Asp 


Tvr 


Ser 


Asp 


Arg 


Glu 


His 


Leu 


Lyis 


Glu 


Arg 


Leu 




78 






175 










X O V/ 










A.OD 








80 gcc 


aat 


gag 


ctt 


uc.g 


gga 


o L.y 


r> 


a rr 
ddy 


gtg 


ty t 


t uy 


ate 


cgc 


gee 


aac 


0 o y 


81 Ala 


Asn 


Glu 


Leu 


Ser 


Glv 


Leu 


Pro 




Val 


Axy 


XICU. 


X xc 


A T'rr 
AXg 


A1 a 


A on 
noli 




82 




190 










X 7 ^ 










900 
z w u 










84 aag 


aga 


gag 


ggc 


^ L.y 


g.ug 


r»/Ta 
oy d 


y 


O rr rr 

egg 


Ctg 


Ctg 


ggg 


geg 


tct 


gcg 


gcg 


7 7 


85 Lys 


Arg 


Glu Gly 


Leu 


Val 


Arg 


Ala 


Arg 


Leu 




fil V 
VJX jr 


Ala 


OCX 


A1 a 

AXu. 


A1 a 

nXd 




86 


205 










210 










91 S 

Z X ^ 












88 agg 


ggc 


gat gtt 








uy 


y av«o 


uy L. 


oa 


uy u 


gag 


tyU 


CdC 


gaa 


7fl ^ 


89 Arg Gly 


Asp . Val 


Leu 


Thr 


Phe 


XICU 




PvQ 


His 




ox LL 


PvQ 


Hi q 


ox u. 




90 220 










99 S 
^ ^ .J 










z o u , 










Z O D 




92 ggg 


tgg 


ctg 


gag 


y 


Ctg 


uy 


L«dy 


agg 


ate 


Cdt 


gaa 


gag 


gag 


teg 


gca 




93 Gly Trp 


Leu 


Glu 


Pro 


Leu 


Leu 


Gin 

v7X il 


Arg 


Tie 
X xc 


nxo 


V3XU 


ox Li 


ox Li 


Q Q 
OCX 


Al a 

rVXd 




94 


















9 A ^ 

Z fl: 










9^0 
Z J u 






96 gtg 


gtg 


tgc 


ccg 


y ug 


a L. L. 


y d L. 


gtg 


a -hn 
d ut 


rrsa r* 
y.d t 


tgg 


a a /-I 
ddt 


ace 


ttc 


gaa 


tdC 


oo± 


97 Val 


Val 


Cys 


Pro 


Val 


He 


Asp 


Val 


He 


Asp 


xxp 


Asn 


Thr 


Phe 


ox u. 






98 






255 










260 










9fi R 
z u ^ 








100 ctg 


ggg 


aac 


tec 


rrrrrr 


yag 




cag 


ate 


rr rr r> 

ggc 


rrrT"H 

ggt 


u u t 


gae 


tgg 


agg 


Ctg 


QOQ 


101 Leu 


Gly 


Asn 


Ser 


Glv 


Glu 


c ±. \J 


Gin 


He 


o X jr 


fil V 


ir lie 


A cn 




'A T*rr 
r\X y 






102 




270 










275 










z ou 










104 gtg 


ttc 


acg 


tgg 


cac 


aca 


y t.t- 


cct 


gag 


a cr rr 

dgg 


rr IT 

gdg 


o^gg 


a -ha 
d ud 


egg 


d uy 


caa 


Q77 

y / / 


105 Val 


Phe 


Thr 


Trp 


His 


Thr 


Val 


Pro 


Glu 


Arg 


Glu 


Arg 


He 


Arg 


Met 


ox ii 




106 . 


285 










290 










9Q s 
z 7 ^ 












108 tec 


ccc 


gtc 


gat 


gtc 




a crcr 

ayy 


tet 


eca 


a r<a 


d uy 


get 


gg^ 


rrrrrr 

ggg 


o rr 

Ctg 


ttt 


1 09*^ 
±uZ 3 


109 Ser 


Pro 


Val 


Asp 


Val 


He 


Arg 


Ser 


Pro 


Thr 


Met 


Ala 


Gly 


Gly 


Leu 


Phe 




110 300 










305 










310 










315 




112 get 


gtg 


agt 


aag 


aaa 


tat 


ttt 


gaa tat 


ctg 


ggg 


tet 


tat 


gat 


aca 


gga 


1073 


113 Ala 


Val 


Ser 


Lys 


Lys 


Tyr 


Phe 


Glu 


Tyr 


Leu 


Gly 


Ser 


Tyr 


Asp 


Thr 


Gly 




114 








320 










'325 










330 






116 atg 


gaa 


gtt 


tgg 


gga 


gga 


gaa 


aac 


etc 


gaa 


ttt 


tec 


ttt 


agg 


ate 


tgg 


1121 


117 Met 


Glu 


Val 


Trp 


Gly 


Gly 


Glu 


Asn 


Leu 


Glu 


Phe 


Ser 


Phe 


Arg 


He 


Trp 




118 






335 










340 










345 








120 cag 


tgt 


ggt 


ggg 


gtt 


ctg 


gaa 


aca 


cac 


eca 


tgt 


tec 


cat 


gtt 


ggc 


cat 


1169 


121 Gin 


Cys 


Gly 


Gly 


val 


Leu 


Glu 


Thr 


His 


Pro 


Cys 


Ser 


His 


val 


Gly 


His 




122 




350 










355 










360 










124 gtt 


ttc 


ccc 


aag 


caa 


get 


ccc 


tac 


tec 


cgc 


aac 


aag 


get 


ctg 


gcc 


aac 


1217 
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125 


Val 


Phe 


Pro 


Lys 


Gin 


Ala 


Pro 


TVT 


Ser 


Aro 


Asn 


Lys 


Ala 




Al a 
Axa 


A cin 
noil 




126 




365 










370 




























^y ^ 


y vci 


you 


rr A a 

y 


fT+* a 

y u^ 


i-yy 


ci cy 


' era +• 
y a L. 


era a 
y aa 


U W L. 


aaa 


rra rr 

gag 


etc 


-h A r* 
taw 




.129 


Sex* 


Val 


Arg 


Ala 


Ala 




Va 1 


lip 






O X Ll 


It lie . 


ijjro 


^^1 n 


T.A11 


lyr 




130 


380 










385 










390 










J ^ mj 






+* a 






a a o 






y v>(^ 


L#y 


tug 


era a 
y aa 




L. L. L. 


yyy 


era t* 
y at 


^'tg 


aca 


J. J J. J 


133^ 


Tvr 
A jf: J. 


His 


Aro 


Asn 


Pro 


Arg 


Ala 


Aiy 


Leu 


Glu 


Pro 


Phe 




AO ^ 


Va 1 
V ax 


± ilX 




JL J *± 










400 

*r V U 










40S 










410 


• 




J. U 




^y y 


ciciy 


^ciy 




egg 


y uU* 


A A 

day 




^^ciy 


ugt 


aaa 


gac 


ttw 


aag 


tgg 




137 


Glu 


Arcr 


Lys 


Gin 


Leu 


Arg 


Asp 


Lys 


Leu 


Gin 


Cys 


Lys 


A Qrj 

AO ^ 


Phe 


T.VQ 
Jj jr o 


ixp 




138 








415 










490 

*± ^ v 










4b J 








140 


ttc 


ttg 


era cr 
y ay 


act 


y uy 


(..Cl L. 


ppa 


era a 
y cici 


L.y 


na "t" 


er't" er 

y '-y 


WW u 


era er 

y ^y 


rr^ r^ 
yaw 


a rrrr 

agg 


r^ r^+" 

W W U 


1 40Q 


141 


Phe 


Leu 


Glu 


Thr 


Val 


Tvr 
J. J. 


Pro 


Glu 


Leu 


His 


Val 


Pro 


Glu 


Asp 


Arg 






142 






430 










435 










440 










144 


y y ^ 


ttc 


ttc 


y y y 


atg 


etc 


cag 


aac 


aaa 


trera 
y y « 


eta 


aca 


era e* 
yaw 


L.a w 


uy w 


U U L. 


1 4S7 


145 


Gly 


Phe 


Phe 


Glv 


Met 


Leu 


Gin 


Asn 


Lys 


Glv 


Leu 


Thr 


Asp 


Tvr 

J, _y X 


Cys 


Phe 




146 




445 










450 










4SS 

*t fj -J 












148 


gac 


tat 


a a p 






era ^ 
y ci u 


craa 
y cLci 


a a o 


oa n* 
way 


a u u 


gtg 


rrrra 

yya 


e«a r* 
wa w 


oa er 
way 


gtc 


a w L. 


1 SOS 


149 


Asp 


Tyr 


Asn 


Pro 


PrO" 


' Asp 


Glu 


Asn 


Gin 


He 


Val 


Glv 


His 


Gin 


Val 


He 




150 


460 










t O J 










470 










47S 




152 


ctg 


tac 


etc 


tgt 


cat 


y y y 


atg 


crerr* 

yy ^ 


pa er 
v^ay 


aat 


oa cj 
way 


L. L> l» 


L. L>W 


era er 

gcig 


a e^ 
L.aw 


a e^er 
awy 


1 SS'^ 


153 


Leu 


Tyr 


Leu 


Cys 


His 


Glv 


Met 


Glv 


Gin 


Asn 


Gin 


Phe 


Phe 


Glu 


Tvr 


Thr 




154 










480 










485 










4Q0 

*x U 






156 


tec 


cag 


CI CI CI 


y cici 


a "t" a 


Vrt^y v.' 




a a r» 


a 


oa r* 


oa rr 
way 


WW U 


rr a rr 

gag 


ggc 


rro 

wy w 


al-l- 
a L. w 




157 


Ser 


Gin 


Lys 


Glu 


lie 


Arcr 
nxy 


Tvr 


Asn 


Thr 


His 


Gin 


Pro 


Glu 


Glv 


Cys 


He 




158 








495 










500 

-J V/ V/ 










SOS 








160 


get gtg 


gaa 


y i^cL 


yya 


a "t" cr 
ci L.y 


era 1~ 
y cL I. 






a 


ai-CT 
a uy 


oa 
wa u 


w uw 


+■ err^ 
wy w 


rr A A 
y aa 


rr^ A 
y r*-^ 


1 fi4Q 
X D ft ;? 


161 


Ala 


Val 


Glu 


Ala 


Glv 


Met 


Asp 


Thr 


Leu 


He 


Met 


His 


Leu 


Cys 


Glu 


Glu 




162 






510 

«J JL v/ 










SI s 










520 










164 

J. u *t 


act 


gcc 




rfarr 
yay 


aa ^ 


oa o 
^ay 


a arr 




a-hn 
a 




r*si rr 
way 


gag 


gat 


gga 


tct 


tta 


1 fiQ7 


165 


Thr 


Ala 


Pro 


Glu 


Asn 


Gin 


Lys 


Phe 


He 


Leu 


Gin 


Glu 


Asp 


Gly 


Ser 


Leu 




166 




525 










530 










535 












168 


ttt 


cac 


gaa 


cag 


tec 


aag 


aaa 


tgt 


gtc 


cag 


get 


geg 


agg 


aag 


gag 


teg 


1745 


169 


Phe 


His 


Glu 


Gin 


Ser 


Lys 


Lys 


Cys 


val 


Gin 


Ala 


Ala 


Arg 


Lys 


Glu 


Ser 




170 


540 










545 










550 










555 




172 


agt 


gac 


agt 


ttc 


gtt 


cea 


etc 


tta 


ega 


gac 


tge 


ace 


aac 


teg 


gat 


eat 


1793 


173 


Ser 


Asp 


Ser 


Phe 


Val 


Pro 


Leu 


Leu 


Arg 


Asp 


Cys 


Thr 


Asn 


Ser 


Asp 


His ■ 




174 










560 










565 










570 






176 


cag 


aaa 


tgg 


ttc 


ttc 


aaa 


gag 


cgc 


atg 


tta 


tga 


agcctcgtgt atcaaggagc 


1846 


177 


Gin 


Lys 


Trp 


Phe 


Phe 


Lys 


Glu 


Arg 


Met 


Leu 


* 














178 








575 










580 


















180 


ccatcgaagg agactgtgga gccaggactc tgcccaacaa 


agacttaget aageagtgac, 1906 


181 


cagaacccac caaaaactag gctgcattgc tttgaagagg 


caatcatttt geeatttgtg 


1966 


182 


aaagttgtgt tggatttagt aaaaatgtga ataagctttg 


tacttatttt gagaaetttt 


2026 


183 


taaatgttcc aaaataccct attttcaaag ggtaatcgta 


agatgttaae ecttggtatt 


2086- 


184 


tagaaaatta aaaccttata atatttttct atcaarawrw 


awattttaca gtcgtgcctt 


2146 


185 


ttactctcat tagcaaaaaa gataaagatt ttattttggt 


atttacaaga attcccaggt 


2206 


186 


acgaagatat ctgcatgggt ggaaatcagg ttcaagcaac 


gtaetttgea ttaactgata 


2266 


187 


atacctcagc tgcggggtta aagttttccc agtatagaga 


gactgtcact aggaacattg 2326 
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188 tattgattta ttcaggtcat tgagatcttc tagatgtatt ttaaaaagaa tgctttttgg 2386 

189 ttatgtgttg ctaccacagt taacactcca taatgttcat gtcagccaaa gaggactaac 2446 

190 caaagctgaa atctcagaga acaatttgct ttactaagct gagtcaactt gagagcgaac 2506 

191 ttctaacaat gccgcactgt agtgtggctg gttctaccac tatgacttta aaacatgttt 2566 

192 atatcatttt taatttttat gatacggtag tgtcagggag aaatgtaatg ttctatatga 2626 

193 aattcctttt tcaagtttgt tcattaataa cagttattaa tttaaatcag cgttagagtt 2686 

194 tgtgctgctg caactgctgt gaaaatttct ctgagtaatt ctgatttgtg aatgatccca 2746 

195 gaccaaccct gagattttgt caacctgatt aagtcaatat gaatgattaa aaagatgtga 2806 

196 gaacaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaa 2850 

198 <210> SEQ ID NO: 2 

199 <211> LENGTH: 581 

200 <212> TYPE: PRT 

201 <213> ORGANISM: homo sapiens 
203 <400> SEQUENCE: 2 



204 


Met 


Tro 


Glv 


Arcr 


Thr 


Ala 


Arg 


Arg 


Arg 


Cys 


XT X W 


Arg 


Glu 


Leu 


Arg Arg 


205 


1 








5 










10 










15 


206 


Glv 




Glu 


Ala 


Leu 


Leu 


Val 


Leu 




Ala 


JJCU 


XICSU 


Ala 


Leu Ala Gly 


207 








20 










25 










30 




208 


Leu' 


Glv 


Ser 


Val 


Leu 


Arcr 


Ala 


Gin 


Arg 


Gl v 


Ala 


Glv 


Ala 


Gly Ala Ala 


209 






35 










40 










45 






210 


Glu 


Pro 


Glv 


Pro 


Pro 


Ara 


Thr 


Pro 


Arg 


Pro 


Glv 


Arg 


Arg 


Glu 


Pro Val 


211 




50 










55 










60 








212 


Met 


Pro 


Arcr 


Pro 


Pro 


Val 


Pro 


Ala 


Asn 


Ala 


Leu 


Glv 


Ala 


Arg 


Gly Glu 


213 


65 










70 










75 








80 


214 


Ala 


Val 


Arg 


Leu 


Gin 


Leu 


Gin 


Gly 


Glu 


Glu 


Leu 


Arg 


Leu 


Gin 


Glu Glu 


215 










85 










90 










95 


216 


Ser 


Val 


Arg 


Leu 


His 


Gin 


He 


Asn 


He 


Tyr 


Leu 


Ser 


Asp 


Arg 


He Ser 


217 








100 










105 










110 




218 


Leu 


His 


Arg 


Arg 


Leu 


Pro 


Glu 


Arg 


Trp 


Asn 


Pro 


Leu 


Cys 


Lys 


Glu Lys 


219 






115 










120 










125 






220 


Lys 


Tyr 


Asp 


Tyr 


Asp 


Asn 


Leu 


Pro 


Arg 


Thr 


Ser 


Val 


He 


He 


Ala Phe 


221 




130 










135 










140 








222 


Tyr 


Asn 


Glu 


Ala 


Trp 


Ser 


Thr 


Leu 


Leu 


Arg 


Thr 


Val 


Tyr 


Ser 


Val Leu 


223 


145 










150 










155 








160 


224 


Glu 


Thr 


Ser 


Pro 


Asp 


He 


Leu 


Leu 


Glu 


Glu 


Val 


He 


Leu 


Val^ 


Asp Asp 


225 










165 










170 










175 


226 


Tyr 


Ser 


Asp 


Arg 


Glu 


His 


Leu 


Lys 


Glu 


Arg 


Leu 


Ala 


Asn 


Glu 


Leu Ser 


227 








180 










185 










190 




228 


Gly 


Leu 


Pro 


Lys 


Val 


Arg 


Leu 


He 


Arg 


Ala 


Asn 


Lys 


Arg Glu Gly Leu 


229 






195 










200 










205 






230 


Val 


Arg 


Ala 


Arg 


Leu 


Leu 


Gly 


Ala 


Ser 


Ala 


Ala 


Arg 


Gly Asp Val Leu 


231 




210 










215 










220 








232 


Thr 


Phe 


Leu 


Asp 


Cys 


His 


Cys 


Glu 


Cys 


His 


Glu 


Gly 


Trp 


Leu 


Glu Pro 


233 


225 










230 










235 








240. 


234 


Leu 


Leu 


Gin 


Arg 


He 


His 


Glu 


Glu 


Glu 


Ser 


Ala 


Val 


Val 


Cys 


Pro Val 


235 










245 










250 










255 


236 


He 


Asp 


Val 


He 


Asp 


Trp 


Asn 


Thr 


Phe 


Glu 


Tyr 


Leu 


Gly Asn 


Ser Gly 


237 








260 










265 










270 




238 


Glu 


Pro 


Gin 


He 


Gly 


Gly 


Phe 


Asp 


Trp 


Arg 


Leu 


Val 


Phe 


Thr 


Trp His 
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RAW SEQUENCE LISTING DATE: 03/12/2002 

PATENT APPLICATION: US/IO/074 , 527 TIME: 17:47:57 



Input Set : . A 8 \pto . ante . txt 

Output Set: N:\CRF3\03122002\J074527 /raw 



23 9 






275 










280 










9 ft s 
^ 0 ~) 








240 


Thr 


Val 


Pro 


Glu Arg 


VJX Li 


a T*rr 

A J. ^ 


He 


Arg 


ric u 


Gl n 


OCX 


fL\J 


V aX 


Asp 


\7a 1 

vax 


241 




290 










295 




















242 


He 


Arg 




Pro 


Thr 




Ala 


Gly Gly 




Pho 
IT 1 lei 




Vax 


Ser 


Lys 


Lys 


243 


305 










310 










315 










0 z u 


244 


Tyr .Phe 


Glu 


Tyr 


Leu 


Glv 


Ser 


Tyr Asp 




G1 V 

v3 X jr 




Gl 11 


V GLX 


irp 


/3l X7 

vjxy 


245 










325 










330 










•D J 




246 


Gly Glu 


Asn 


Leu 


Glu 


IT iXKi 


OCX 


Phe 


Arg 


Tip 


irp 


Gl n 




Gl \7 


P 1 


\7a 1 

vax 


247 








340 










345 
















248 


Leu 


Glu 


Thr 


His 


Pro 






His 


Val 


Gl V 


xiXo 


V dx 


XT lie 


Pro 


Lys 


v^xn 


249 






355 










360 










:^65 

•J U .J 








250 


Ala 


Pro 


lyr 


Ser Arg 


Aon 


T.T7 e 


Ala 


Leu 


& 1 A 


Asn 


oex 


Vax 


Arg 


Ala 

Axa 


Ala 

Axa 


251 




370 










375 










•J 0 V 










252 


Glu 


val 




Met 


Asp 


m 11 


It 11c 


Lys 


Glu 


XlCU 


lyr 


lyr 




Arg 


Asn 


Pro 


253 


385 










390 










0 27 ^ 










Ann 

fi U U 


254 


Arg 


Ala 


Arg 


Leu 


Glu 


IT A.\J 


ir lie 


Gly Asp 


Va 1 
V d J. 


T V» Y* 
J. iXX 


G1 n 
ox U 


AX ^ 


Lys 




Leu 


255 










405 










t X y 










ft xD 




256 


Arg 


Asp 


Lys 


Leu 


Gin 


PVQ 


TiVG 
XJ jr 3 


Asp 


Phe 




irp 




IjcU. 




Thr 


Va 1 
VaX 


257 








420 










425 










4 J u 






258 


Tyr 


Pro 


Glu 


Leu 


His 


V a X 


XT X ^ 


Glu 


Asp 


AX ^ 


PT*r\ 
irxLI 


G1 V 
oxy 


Pho 


flic 


ni \r 
oxy 


JncL. 


259 






435 










440 










*± •* J 








260 


Leu 


Gin 


Asn 


Lys 


Gly 


Leu 


Thr 


Asp 


Tyr 




PhP 

IT IXC 


A or* 


lyr 


Aon 

AOXl 


irx U 


fx 0 


261 




450 










455 










*± U VJ 










262 


Asp 


Glu 


Asn 


Gin 


He 


Val 


Gly 


His 


Gin 


V a. X 


X xc? 


XjcU 


lyr 


JjcU, 


v-^y 0 


nxo 


263 


465 










470 










475 










* 0 v 


264 


Gly Met 


Glv 


Gin 


Asn 


Gin 


Php 


Phe 


Glu 


lyx 


X liX 


OCX 


Gl n 


jjy 0 


Gl n 


Tl 0 

XXc 


265 










485 










490 










495 




266 


Arg 


Tyr 


Asn 


Thr 


His 


Gin 


Pro 


Glu 


Gly 


Cys 


He 


Ala 


Val 


Glu 


Ala 


Gly 


267 








500 










505 










510 






268 


Met 


Asp 


Thr 


Leu 


He 


Met 


His 


Leu 


Cys 


Glu 


Glu 


Thr 


Ala 


Pro 


Glu 


Asn 


269 






515 










520 










525 








270 


Gin 


Lys 


Phe 


He 


Leu 


Gin 


Glu 


Asp Gly 


Ser 


Leu 


Phe 


His 


Glu 


Gin 


Ser 


271 




530 










535 










540 










272 


Lys 


Lys 


Cys 


Val 


Gin 


Ala 


Ala 


Arg 


Lys 


Glu 


Ser 


Ser 


Asp 


Ser 


Phe 


Val 


273 


545 










550 










555 










560 


274 


Pro 


Leu 


Leu 


Arg 


Asp 


Cys 


Thr 


Asn 


Ser 


Asp 


His 


Gin 


Lys 


Trp 


Phe 


Phe 


275 










565 










570 










575 




276 


Lys Glu 


Arg 


Met 


Leu 

























277 580 

280 <210> SEQ ID NO: 3 

281 <211> LENGTH: 1746 

282 <212> TYPE: DNA 

283 <213> ORGANISM: homo sapiens 

285 <220> FEATURE: 

286 <221> NAME/KEY: CDS 

287 <222> LOCATION: (1)...(1746) 

289 <400> SEQUENCE: 3 

290 atg tgg ggg cgc acg gcg egg egg cgc tge eeg egg gaa ctg egg egc 48 

291 Met Trp Gly Arg Thr Ala Arg Arg Arg Cys Pro Arg Glu Leu Arg Arg 
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RAW SEQUENCE LISTING ERROR SUMMARY DATE: 03/12/2002 

PATENT APPLICATION: US/10/074,527 TIME: 17:47:58 

Input Set : A5\pto.a1nc.txt 

Output Set: N:\CRF3\03122002\J074527.raw 

Please Note; 

Use of n and/or Xaa have been detected In the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explwation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:9; Xaa Pes. 2,3,4,5,6,7,9,10,11,12,13,14,16,17,18,19,20,21 
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OIPE 



4 <110> 

5 ^ 
6 
7 

9 <120:i' 

10 

12 <130> 
-V 14 <140> 



14 

14 



<141> 

<150> 
15 <151> 
17 .<160> 
19 <170> 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/074> 527 



DATE: 02/28/2002 
TIME: 10:02:19 



Input Set : A:\sequence listing .txt 
Output Set: N:\CRF3\02282002\J074527.raw 



Peter J. 



APPLICANT: Olandt, 
Meyers, Rachel E. 
Galvin, Katherine A. 
Millennium Pharmaceuticals Inc. 

TITLE OP INVENTION: 33945, A Human Glycosyltransferase and 

Uses Therefor 

FILE REFERENCE: MPI2001 - 018P1RCP1 (M) ; * 
CURRENT APPLICATION NUMBER: US/10/074 ^ 527; 
CURRENT FILING DATE: 2002-02-12 

PRIOR APPLICATION NUMBER: 60/2 69202 

PRIOR FILING DATE: 2001-02-15 ■ 
NUMBER OF SEQ ID NOS : - 9 

SOFTWARE : FastSEQ for Windows Version 4.0 



Does Not Comply 
Corrected Diskette Needec 



tfff 



ERRORED SEQUENCES 

. 649 <210> SEQ ID NO: .9 . - . - 

650 <211> LENGTH:. 22 " . ' : • 
,651 <212>. TYPE: PRT 

652 <213> ORGANISM: Artificial Sequence 

654 <220> FEATURE: ' ' 

655 <223> OTHER INFORMATION: consensus. 
S51 <221> NAME/KEY: VARIANT 

658 <222> LOCATION: (1) ... (22) ' * . . 

. ' ' 659 <223> OTHER INFORMATION: Xaa = any amino acid 

-. 661 <400> SEQUENCE: 9. . . 

W--> 662 Leu Xaa Xaa Xaa Xaa/ Xaa Xaa Leu. Xaa Xaa Xaa Xaa Xaa Xaa Leu Xaa 

663 1 '5 10 , . 15 

W--'> 664 Xaa Xaa Xaa Xaa Xaa Leu 

665 ^^^..^ 20 
E--> 667 /- 1 -J 
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VERIFICATION SUMMARY DATE: 02/28/2002 

PATENT APPLICATION: US/10/074,527 TIME: 10:02:20 

Input Set : A:\seguence listing.txt 
Output Set: N:\CRF3\02282002\J074527.raw 

L:14 M:270 C: Current Application Number differs, Replaced Current Application No 

L:14 M:271 C: Current Filing Date differs, Replaced Current Filing Date 

L:662 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 9 

L:664 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 9 

L:667 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID: 9 
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